Overview of the Patent Machine Translation Task at the NTCIR-10 Workshop

نویسندگان

  • Isao Goto
  • Ka-Po Chow
  • Bin Lu
  • Eiichiro Sumita
  • Benjamin Ka-Yin T'sou
چکیده

This paper gives an overview of the Patent Machine Translation Task (PatentMT) at NTCIR-10 by describing its evaluation methods, test collections, and evaluation results. We organized three patent machine translation subtasks: Chinese to English, Japanese to English, and English to Japanese. For these subtasks, we provided large-scale test collections, including training data, development data and test data. In total, 21 research groups participated and 144 runs were submitted. We performed four types of evaluations: Intrinsic Evaluation (IE), Patent Examination Evaluation (PEE), Chronological Evaluation (ChE), and Multilingual Evaluation (ME). We conducted human evaluations for IE and PEE.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overview of the Patent Machine Translation Task at the NTCIR-9 Workshop

This paper gives an overview of the Patent Machine Translation Task (PatentMT) at NTCIR-9 by describing the test collection, evaluation methods, and evaluation results. We organized three patent machine translation subtasks: Chinese to English, Japanese to English, and English to Japanese. For these subtasks, we provided large-scale test collections, including training data, development data an...

متن کامل

Overview of the Patent Translation Task at the NTCIR-7 Workshop

To aid research and development in machine translation, we have produced a test collection for Japanese/English machine translation and performed the Patent Translation Task at the Seventh NTCIR Workshop. To obtain a parallel corpus, we extracted patent documents for the same or related inventions published in Japan and the United States. Our test collection includes approximately 2 000 000 sen...

متن کامل

Overview of the Eighth NTCIR Workshop

For the Eighth NTCIR Workshop (NTCIR-8), we selected and organized seven research areas as “tasks” to investigate, test and benchmark newly constructed test collections. These areas are Complex Cross-Lingual Question Answering (CCLQA), Information Retrieval for Question Answering (IR4QA), Geographic and Temporal Search (GeoTime), Multilingual Opinion Analysis (MOAT), Patent Machine Translation ...

متن کامل

ISTIC Statistical Machine Translation System for Patent machine translation in NTCIR-9

This paper describes statistical machine translation system of ISTIC used in the evaluation campaign of the patent machine translation task at NTCIR-9. In this year's evaluation, we participated in patent machine translation task for ChineseEnglish. Here we mainly describe the overview of the system, the primary modules, the key techniques and the evaluation results.

متن کامل

Overview of the Patent Translation Task at the NTCIR-8 Workshop

Motivation Motivation l / • Systematic evaluation in NLP / IR is crucial – Large Test Collections are needed • We have produced test collections for p patent retrieval at NTCIR since 2001 – What is the next task for patent What is the next task for patent information? 2 History of Patent IR at NTCIR y • NTCIR-3 (2001-2002) T h l 2 years of JPO patent li ti * – Technology survey • Applied conven...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013